An Empirical Study on Class-Based Word Sense Disambiguation
نویسندگان
چکیده
As empirically demonstrated by the last SensEval exercises, assigning the appropriate meaning to words in context has resisted all attempts to be successfully addressed. One possible reason could be the use of inappropriate set of meanings. In fact, WordNet has been used as a de-facto standard repository of meanings. However, to our knowledge, the meanings represented by WordNet have been only used for WSD at a very fine-grained sense level or at a very coarse-grained class level. We suspect that selecting the appropriate level of abstraction could be on between both levels. We use a very simple method for deriving a small set of appropriate meanings using basic structural properties of WordNet. We also empirically demonstrate that this automatically derived set of meanings groups senses into an adequate level of abstraction in order to perform class-based Word Sense Disambiguation, allowing accuracy figures over 80%.
منابع مشابه
Exploratory Study of Word Sense Disambiguation Methods for Verbs in Brazilian Portuguese
Word Sense Disambiguation (WSD) aims at identifying the correct sense of a word in a given context. WSD is an important task for other applications as Machine Translation or Information Retrieval. For English, WSD has been widely studied, obtaining different performances. Analyzing by morphosyntactic class, Verb is the hardest class to be disambiguated. Verbs are an important class and help to ...
متن کاملA Simple Approach to Building Ensembles of Naive Bayesian Classi ers for Word Sense Disambiguation
This paper presents a corpus-based approach to word sense disambiguation that builds an ensemble of Naive Bayesian classi ers, each of which is based on lexical features that represent co{occurring words in varying sized windows of context. Despite the simplicity of this approach, empirical results disambiguating the widely studied nouns line and interest show that such an ensemble achieves acc...
متن کاملClass-based collocations for Word Sense Disambiguation
This paper describes the NMSU-Pitt-UNCA word-sense disambiguation system participating in the Senseval-3 English lexical sample task. The focus of the work is on using semantic class-based collocations to augment traditional word-based collocations. Three separate sources of word relatedness are used for these collocations: 1) WordNet hypernym relations; 2) cluster-based word similarity classes...
متن کاملAn Ensemble Approach to Corpus Based Word Sense Disambiguation
This paper presents a corpus{based approach to word sense disambiguation that combines a number of Naive Bayesian classiers into an ensemble that performs disambiguation via a majority vote. Each of the member classiers is based on collocation and co{occurrence features found in varying sized windows of context. This approach is motivated by the observation that, in general, enhancing the featu...
متن کاملAn Empirical Study of the Domain Dependence of Supervised Word Sense Disambiguation Systems
This paper describes a set of experiments carried out to explore the domain dependence of alternative supervised Word Sense Disambiguation algorithms. The aim of the work is threefold: studying the performance of these algorithms when tested on a di erent corpus from that they were trained on; exploring their ability to tune to new domains, and demonstrating empirically that the LazyBoosting al...
متن کامل